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One of the most important requirements for a detector at the ILC is good jet energy resolution. 
It is widely believed that the particle flow approach to calorimetry is the key to achieving the goal 
of 0.3/ y/E(GeV). This paper describes the current performance of the PandoraPFA particle flow 
algorithm. For 45 GeV jets in the Tesla TDR detector concept, the ILC jet energy resolution goal 
is reached. At higher energies the jet e nergy res olution becomes worse and can be described by the 
empirical expression: a E /E w 0.265/ y/E(GeV) + 1.2 x 10~ 4 £;(GeV). 
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I. INTRODUCTION 



-a 

Many of the interesting physics processes at the ILC will be characterised by multi-jet final states, often accompa- 
nied by charged leptons and/or missing transverse energy associated with neutrinos or the lightest super-symmetric 
particles. The reconstruction of the invariant masses of two or more jets will provide a powerful tool for event 
& reconstruction and identification. Unlike at LEP, where kinematic fitting^ enabled precise jet-jet invariant mass 
reconstruction almost independent of the jet energy resolution, at the ILC this mass reconstruction will rely on the 
detector having excellent jet energy resolution. The ILC goal is to achieve a mass resolution for W — > q 'q and Z — > qq 



decays which is comparable to their natural widths, i.e. ^2 GeV. A jet energy resolution of <te/E — a/ y/E(GeV) leads 



O ■ to a di-jet mass resolution of roughly a m /m — a/ y / Ejj(GeV), where Ejj is the energy of the di-jet system. At the ILC 
typical di-jet energies will be in the range 150- 350 GeV, suggesting the goal of a E /E = .3 / y/ E (GeV) . This is more 
^—1 ' than a factor two better than the best jet energy resolution achieved at LEP, <7e/E — 0.6(1 + | cos#|)/ yj E(GeV) 2\. 
J> . Meeting the jet energy resolution goal is a major factor in the overall design of a detector for the ILC. 
t-H ■ 

\o ; 

. II. THE PARTICLE FLOW APPROACH TO CALORIMETRY 

l> ■ 

o ■ 

\Q ' It is widely believed that the most promising strategy for achieving a jet energy resolution of cfe/E = 
0.30/^E(GeV) at the ILC is the particle flow analysis (PFA) approach to calorimetry In contrast to a purely 
calorimetric measurement, particle flow requires the reconstruction of the four- vectors of all visible particles in an 
event. The reconstructed jet energy is the sum of the energies of the individual particles. The momenta of charged 
r/3 ■ particles are measured in the tracking detectors, while the energy measurements for photons and neutral hadrons 
is performed with the calorimetric system. The crucial step of the particle flow algorithm is to assign the correct 
. calorimeter hits to reconstructed particles, requiring efficient separation of nearby showers. 

Measurements of jet fragmentation at LEP have provided detailed information on the particle composition of jets 
(e.g. 0,9) On average, after the decay of short-lived particles, roughly 62% of the energy of jets is carried by 
charged particles (mainly hadrons), around 27% by photons, about 10% by long-lived neutral hadrons (e.g. n/K"), 



and around 1.5% by neutrinos. Assuming calorimeter resolutions of <je/E = 0.15/ \/E(GeV) for photons and ue/E = 
& \ 0.55y/ E(GeV) for hadrons, a jet energy resolution of 0.19/^/S(GeV) is obtained with the contributions from tracks, 
photons and neutral hadrons shown in Tab. [I] In practice it is not possible to reach this level of performance for 
two main reasons. Firstly, particles travelling at small angles to the beam axis will not be detected. Secondly, 
and more importantly, it is not possible to perfectly associate all energy deposits with the correct particles. For 
example, if a photon is not resolved from a charged hadron shower, the photon energy is not counted. Similarly, if 
some of the energy from a charged hadron is identified as a separate cluster the energy is effectively double-counted. 
This confusion degrades particle flow performance. The crucial aspect of particle flow is the ability to correctly 
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assign calorimeter energy deposits to the correct reconstructed particles. This places stringent requirements on the 
granularity of electromagnetic and hadron calorimeters. Consequently, particle flow performance is one of the main 
factors driving the overall ILC detector design. It should be noted that the jet energy resolution obtained for a 
particular detector concept is the combination of the intrinsic detector performance and the performance of the PFA 
software. 

Component Detector Energy Fraction Energy Res. Jet Energy Res. 

Charged Particles Tracker ~0.6£j o t 1CT 4 E 2 X ± < 3.6 x 1CP 5 Ef ct 

Photons (7) ECAL ~0.3£j c t 0.15 j% 0.08 </E^ 

Neutral Hadrons (h°) HCAL ~0.lE jet 0.55 y/E^ 0.17 ■s/B^ 

TABLE I: Contributions from the different particle components to the jet-energy resolution (all energies in GeV). The table 
lists the approximate fractions of charged particles, photons and neutral hadrons in a jet and the assumed single particle energy 
resolution. 



III. THE PANDORAPFA PARTICLE FLOW ALGORITHM 

PandoraPFA 5] is a C++ implementation of a PFA algorithm running in the Marlin^, 0] framework. It 
was designed to be sufficiently generic for ILC detector optimisation studies and was developed and optimised using 
events generated with the Mokka 8] program, which provides a GEANT4[9j simulation of the Tesla TDR[lO| detector 
concept. The PandoraPFA algorithm performs both calorimeter clustering and particle flow in a single stage. The 
algorithm has six main stages: 

i) Tracking: for the studies presented in this paper, the track pattern recognition is performed using Monte Carlo 
information 6]. The track parameters are then extracted using a helical fit. The projections of tracks onto the front 
face of the electromagnetic calorimeter are calculated using helical fits (which do not take into account energy loss 
along the track). Neutral particle decays resulting in two charged particle tracks (V°s) are identified by searching for 
pairs of tracks which do not originate from the interaction point and that are consistent with coming from a single 
point in space. Kinked tracks from charged particle decays to a single charged particle and a number of neutrals 
are also identified. When a kink is identified the parent track is usually removed for the purposes of forming the 
reconstructed particles. 

ii) Calorimeter Hit Selection and Ordering: isolated hits, defined on the basis of proximity to other hits, are 
removed from the initial clustering stage. The remaining hits are ordered into pseudo-layers which follow the detector 
geometry so that particles propagating outward from the interaction region will cross successive pseudo-layers. In 
most of the calorimeter the pseudo-layers follow the physical layers of the calorimeters except in the barrel-endcap 
overlap region and where the ECAL stave structure [l(J] results in low numbered layers which are far from the front 
face of the calorimeter. The assignment of hits to pseudo-layers removes the dependence of the algorithm on the 
explicit detector geometry whilst following the actual geometry as closely as possible. Within each pseudo-layer hits 
are ordered by decreasing energy. 

iii) Clustering: the main clustering algorithm is a forward projective method working from innermost to outermost 
pseudo-layer. In this manner hits are added to clusters or are used to seed new clusters. Throughout the clustering 
algorithm clusters are assigned a direction (or directions) in which they are growing. The algorithm starts by seeding 
clusters using the projections of reconstructed tracks onto the front face of the calorimeter. The initial direction of a 
track-seeded cluster is obtained from the track direction. The hits in each subsequent pseudo-layer are then looped 
over. Each hit, i, is compared to each clustered hit, j, in the previous layer. The vector displacement, r^, is calculated 
and is used to calculate the parallel and perpendicular displacement of the hit with respect to the unit vector(s) u 
decribing the cluster propagation direction(s), d\\ — r^ .u and d± — |r^ xu|. Associations are made using a cone-cut, 
dj_ < du tana + /3Z? pac j, where a is the cone half-angle, -D pa d is the size of a sensor pixel in the layer being considered, 
and (3 is the number of pixels added to the cone radius. Different values of a and f3 are used for the ECAL and HCAL 
with the default values set to {tan cue = 0.3, @e — 1-5}, and {tan an = 0.5, /?h = 2.5} respectively. Associations may 
be made with hits in the previous 3 layers. If no association is made, the hit is used to seed a new cluster. This 
procedure is repeated sequentially for the hits in each pseudo-layer (working outward from ECAL front-face). 

iv) Topological Cluster Merging: by design the initial clustering errs on the side of splitting up true clusters 
rather than clustering energy deposits from more than one particle. The next stage of the algorithm is to merge 
clusters from tracks and hadronic showers which show clear topological signatures of being associated. A number 
of track-like and shower-like topologies are searched for including looping minimum ionising tracks, back-scattered 
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FIG. 1: a) PandoraPFA reconstruction of a 100 GeV jet in the Mokka simulation of the Tesla TDR detector, b) The total 
reconstructed energy from reconstructed PFOs in Z — > uds events for initial quark directions within the polar angle acceptance 
cos# q q| < 0.8. The solid line shows a Gaussian fit to the peak region with a standard deviation of 2.9 GeV. 

tracks and showers associated with a hadronic interaction. Before clusters are merged, a simple cut-based photon 
identification procedure is applied. The cluster merging algorithms are only applied to clusters which have not been 
identified as photons. 

v) Statistical Re-clustering: The previous four stages of the algorithm were found to perform well for 50 GeV jets. 
However, at higher energies the performance degrades rapidly due to the increasing overlap between hadronic showers 
from different particles. To address this, temporary associations of tracks with reconstructed calorimeter clusters are 
made. If the track momentum is incompatible with the energy of the associated cluster re-clustering is performed. 
If -Ecal — £track > 3.5<te, where oe is the energy resolution of the cluster, the clustering algorithm, described in 
Hi) and iv) above, is reapplied to the hits in that cluster. This is repeated, using successively smaller values of the 
as and /3s in the clustering finding algorithm (stage Hi)) until the cluster splits to give an acceptable track-cluster 
energy match. Similarly, if £track — Eqal > 3.5cte the algorithm attempts to merge additional clusters with the 
cluster associated with the track. In doing so high energy clusters may be split as above. 

vi) Formation of Particle Flow Objects: The final stage of the algorithm is to create Particle Flow Objects (PFOs) 
from the results of the clustering. Tracks are matched to clusters on the basis of the distance closest approach of the 
track projection into the first 10 layers of the calorimeter. If a hit is found within 50 mm of the track extrapolation 
an association is made. The reconstructed PFOs are written out in LciO 6] format. 

IV. PERFORMANCE 



Fig. [TJi) shows an example of a PandoraPFA reconstruction of a 100 GeV jet from a Z — + uu decay at s/s = 
200 GeV. The ability to track particles in the high granularity Tesla TDR calorimeter can be seen clearly. Fig. lb) 
shows the total PEA reconstructed energy for Z — ► uds events with |cos# q q| < 0.8, where qq - is the polar angle 
of the generated qq system. These events were generated at y/s = 91.2 GeV using the Tesla TDR detector model. 
The root-mean-square deviation from the mean (mis) of the distribution is 4.0 GeV. However, quoting the rms as 
a measure of the performance over-emphasises the importance of the tails. For example, in this figure, the central 
peak is well described by a Gaussian of width 2.9 GeV, equivalent to a r esolution of a E /E = 0.31/ v /^(GeV). In 
this paper two measures of the performance are quoted. The first measure, rmsgrj, is the rms in the smallest range 
of reconstructed energy which contains 90 % of the events. The second performance measure is obtained from a fit 
to the reconstructed energy distribution. The fit function is the sum of two Gaussian distributions with a common 
mean but different widths. The width of the narrower Gaussian, which is constrained to contain 75% of the events, 
gives a measure of the resolution in the peak, 075. For the data shown in Fig. ^i) both methods give a resolution 
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FIG. 2: a) The total reconstructed energy from reconstructed PFOs in Z — > uds at yfs = 360 GeV for initial quark directions 
within the polar angle acceptance | cos 9\ < 0.8. The solid line shows a results of the fit to two Gaussians and the dashed line 
indicates the contribution from the broader Gaussian which is constrained to contain 25 % of the events, b) The jet energy 
resolution, defined as the a in oe/E — ay/ E(GeV), plotted versus cos# qq - for four different values of y/s. 



of a E /E = 0.3/ v / J B(GcV); the ILC goal. However, this is of little consequence to ILC physics where, in general, the 
jets will be higher in energy. 

The majority of interesting ILC physics will consist of final states with at least six fermions, setting a "typical" 
energy scale for ILC jets as approximately 85 GeV and 170 GeV at y/s — 500 GeV and y/s =lTeV respectively. 
Fig. 01 shows the reconstructed total energy in Z -^uds events (generated without ISR or beamstrahlung effects) 
at y/s = 360 GeV. The fit to the sum of a double Gaussian gives 075 = 10.8 GeV, equivalent to a resolution of 
oe/E = 0.57 /yjE '(GeV), significantly worse than that obtained for lower energy jets. Fig. |21 shows the jet energy 
resolution for Z -^uds events plotted against |cos6* q q| for four different values of y/s. 



V. DISCUSSION 



The results described above are summarised in Tab. [H] The observed jet energy resolution in simulated events 
is not described by the expression oe/E — a/ y/E(GeV). This is not surprising, as the particle density increases 
it becomes harder to correctly associate the calorimetric energy deposits to the particles and the confusion term 
increases. Empirically it is found that the total energy resolutions in Tab. [n] can be described by a jet energy 
resolution of a E /E = 0.265/ y/E (GeV) + 1.2 x 10" 4 ^(GeV), where E is the jet energy. This expression represents the 
current performance of the PandoraPFA algorithm and should not be be considered as anything more fundamental. 
It should be noted that in the current Mokka simulation of the Tesla TDR detector the muon chambers are not 
included. In principle these can be used as a "tail-catcher" to improve the energy measurement for high energy 
hadronic showers which may not be fully contained in the HCAL. In the current version of PandoraPFA no attempt 
is made to correct for this energy leakage. It is noticeable in Fig.|2t> that the energy resolution improves with increasing 
polar angle in the barrel region of the detector, possibly due to increasing shower containment. 

Jet Energy rmsgo rmsgo / yj E(GeV) 0-75 0-75 / \J E(GeV) 



45 GeV 


2.8 GeV 


0.30 


2.8 GeV 


0.30 


100 GeV 


5.3 GeV 


0.38 


5.2 GeV 


0.37 


180 GeV 


11.0 GeV 


0.58 


10.8 GeV 


0.57 


250 GeV 


16.8 GeV 


0.76 


16.8 GeV 


0.75 



TABLE II: Jet energy resolution, expressed as both rmsgo and (J75, for Z — >uds events with |cos# q q| < 0.8. 
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VI. CONCLUSIONS 



Particle flow calorimetry is widely believed to be the key to reaching the ILC jet energy resolution goal oi <je/E = 
0.3/ \/E(GeV). Consequently, the design and optimisation of detectors for the ILC depends both on hardware and 
on sophisticated software reconstruction. For the Tesl a TDR detector concept, the PandoraPFA particle flow 
algorithm achieves good performance, < 0.4/ y/E(GeV), for jet energies upto about 100 GeV. For higher energies 
the performance degrades significantly reaching the equivalent of 0.6/ y/E(GeV) for 200 GeV jets. With further 
optimisation of the algorithm the performance is expected to improve. However, the current algorithm is adequate for 
most ILC physics studies at s/s = 500 GeV and may be used for the optimisation of the design of the ILC detector (s). 



[1] M. A. Thomson, Proc. of EPS-HEP 2003, Aachen. Topical Vol. of Eur. Phys. J. C Direct (2004). 
[2] ALEPH Collaboration, D. Buskulic et al, Nucl. Inst. Meth. A360 (1995) 481. 
[3] I.G. Knowles and G.D. Lafferty, J. Phys. G23 (1997) 731. 

[4] M. G. Green, S. L. Lloyd, P. N. Ratoff and D. R. Ward, "Electron-Positron Physics at the Z", IoP Publishing (1998). 
[5] http:/ /www.hep.phy.cam.ac.uk~thomson/pandoraPFA/ 
[6] http:/ /www-flc. desy.de/ilcsoft/ilcsoftware/ 

[7] O. Wendt, "Marlin and MarlinReco", to appear in Proc. of LCWS06, Bangalore, March 2006. 
[8] http : / / poly www. in2p3 .fr / activites /physique /geant4 / tesla /www / mokka / 
[9] GEANT4 collaboration, S. Agostinelli et al, Nucl. Instr. and Meth. A506 (2003) 3; 
GEANT4 collaboration, J. Allison et al, IEEE Trans. Nucl. Sci. 53 (2006) 1. 
[10] TESLA Technical Design Report, DESY 2001-011, ECFA 2001-2009 (2001). 



